Search CORE

379 research outputs found

Discovering Restricted Regular Expressions with Interleaving

Author: A. Ignatiev
E.M. Gold
G.J. Bex
J. Pei
R.W. Bailey
S. Tsukiyama
Publication venue
Publication date: 01/04/2015
Field of study

Discovering a concise schema from given XML documents is an important problem in XML applications. In this paper, we focus on the problem of learning an unordered schema from a given set of XML examples, which is actually a problem of learning a restricted regular expression with interleaving using positive example strings. Schemas with interleaving could present meaningful knowledge that cannot be disclosed by previous inference techniques. Moreover, inference of the minimal schema with interleaving is challenging. The problem of finding a minimal schema with interleaving is shown to be NP-hard. Therefore, we develop an approximation algorithm and a heuristic solution to tackle the problem using techniques different from known inference algorithms. We do experiments on real-world data sets to demonstrate the effectiveness of our approaches. Our heuristic algorithm is shown to produce results that are very close to optimal.Comment: 12 page

arXiv.org e-Print Archive

Crossref

Active learning of group-structured environments

Author: E.M. Gold
F. Stephan
H. Jaeger
J.C. Culberson
J.J. Rothman
W.W. Boone
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2008
Field of study

The question investigated in this paper is to what extent an input representation influences the success of learning, in particular from the point of view of analyzing agents that can interact with their environment. We investigate learning environments that have a group structure. We introduce a learning model in different variants and study under which circumstances group structures can be learned efficiently from experimenting with group generators (actions). Negative results are presented, even without efficiency constraints, for rather general classes of groups showing that even with group structure, learning an environment from partial information is far from trivial. However, positive results for special subclasses of Abelian groups turn out to be a good starting point for the design of efficient learning algorithms based on structured representations

CiteSeerX

Crossref

SZTAKI Publication Repository

Early changes in sagittal plane knee biomechanics after total knee arthroplasty

Author: Debbi E.M.
Bernfeld B.
Gray E.
Salai M.
Levy Y.
Gold A.
Debi R.
Wolf A.
Publication venue: Published by Elsevier Ltd.
Publication date: 01/01/2008
Field of study

Elsevier - Publisher Connector

Crossref

Repositori Obert de Coneixement de l'Ajuntament de Barcelona

Learning stochastic finite automata from experts

Author: D. Angluin
E.M. Gold
E.M. Gold
K. Lari
N. Abe
P. García
T. Goan
W. Hoeffding
Y. Sakakibara
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Searching for Leptoquarks in electron-photon Collisions

Author: Abbott
Abbott
Abramowicz
Arutyunian
Blumlein
Buchmüller
Buchmüller
de Montigny
Dimopoulos
Dobado
Dobado
Drees
Duke
E.M. Gregores
Ellis
Farhi
Gold
Gunion
Hewett
Hewett
Hewett
Hewett
Langacker
M.B. Magro
Milburn
Nadeau
O.J.P. Éboli
P.G. Mercadante
S.F. Novaes
Telnov
Witten
Wudka
Yehudai
Éboli
Éboli
Publication venue: 'Elsevier BV'
Publication date: 01/01/1993
Field of study

We study the production of composite scalar leptoquarks in

e\gamma

colliders, and we show that an

e^+e^-

machine operating in its

e\gamma

mode is the best way to look for these particles in

e^+e^-

collisions, due to the hadronic content of the photon.Comment: 12 pages in REVTeX3. 6 figures appended as postcript files. Report: IFT-P.014/93 and IFUSP-P 104

arXiv.org e-Print Archive

Crossref

CERN Document Server

Hitting all Maximal Independent Sets of a Bipartite Graph

Author: C. Wrathall
C.H. Papadimitriou
D. Duffus
D. Duffus
D. Marx
E.M. Gold
G. Bacsó
G. Durán
G. Durán
Gwenaël Joret
J. Kratochvíl
Jean Cardinal
L.J. Stockmeyer
M. Schaefer
P. Erdös
T. Andreae
T. Andreae
V. Balachandran
V. Guruswami
Z. Lonc
Z. Tuza
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/12/2013
Field of study

We prove that given a bipartite graph G with vertex set V and an integer k, deciding whether there exists a subset of V of size k hitting all maximal independent sets of G is complete for the class Sigma_2^P.Comment: v3: minor chang

arXiv.org e-Print Archive

CiteSeerX

Crossref

DI-fusion

Variable length-based genetic representation to automatically evolve wrappers

Author: B. Hutt
C.L. Ramsey
D. Camacho
D. Chu
D. Goldberg
D.F. Barrero
D.S. Burke
E.M. Gold
J.E.F. Friedl
J.G. Brookshear
J.H. Holland
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-12433-4_44Proceedings 8th International Conference on Practical Applications of Agents and Multiagent SystemsThe Web has been the star service on the Internet, however the outsized information available and its decentralized nature has originated an intrinsic difficulty to locate, extract and compose information. An automatic approach is required to handle with this huge amount of data. In this paper we present a machine learning algorithm based on Genetic Algorithms which generates a set of complex wrappers, able to extract information from theWeb. The paper presents the experimental evaluation of these wrappers over a set of basic data sets.This work has been partially supported by the Spanish Ministry of Science and Innovation under the projects Castilla-La Mancha project PEII09-0266-6640, COMPUBIODIVE (TIN2007-65989), and by V-LeaF (TIN2008-02729-E/TIN)

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Biblos-e Archivo

Mining State-Based Models from Proof Corpora

Author: A. Biermann
A. Bundy
D. Kühlwein
E.M. Gold
F. Wiedijk
G. Gonthier
G. Gonthier
G. Grov
J. Alama
J. Heras
J. Heras
J. Meng
K.J. Lang
K.T. Cheng
L. Dixon
L.C. Paulson
M. Hall
M. Jamnik
N. Walkinshaw
N. Walkinshaw
S. Böhme
T. Nipkow
X. Leroy
Publication venue
Publication date: 01/01/2014
Field of study

Interactive theorem provers have been used extensively to reason about various software/hardware systems and mathematical theorems. The key challenge when using an interactive prover is finding a suitable sequence of proof steps that will lead to a successful proof requires a significant amount of human intervention. This paper presents an automated technique that takes as input examples of successful proofs and infers an Extended Finite State Machine as output. This can in turn be used to generate proofs of new conjectures. Our preliminary experiments show that the inferred models are generally accurate (contain few false-positive sequences) and that representing existing proofs in such a way can be very useful when guiding new ones.Comment: To Appear at Conferences on Intelligent Computer Mathematics 201

arXiv.org e-Print Archive

CiteSeerX

Crossref

Leicester Research Archive

A case study on grammatical-based representation for regular expression evolution

Author: A.E. Eiben
B.D. Dunay
D.F. Barrero
E.M. Gold
G. Zipf
J.E.F. Friedl
K. Thompson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-12433-4_45Proceedings of 8th International Conference on Practical Applications of Agents and Multiagent SystemsRegular expressions, or simply regex, have been widely used as a powerful pattern matching and text extractor tool through decades. Although they provide a powerful and flexible notation to define and retrieve patterns from text, the syntax and the grammatical rules of these regex notations are not easy to use, and even to understand. Any regex can be represented as a Deterministic or Non-Deterministic Finite Automata; so it is possible to design a representation to automatically build a regex, and a optimization algorithm able to find the best regex in terms of complexity. This paper introduces both, a graph-based representation for regex, and a particular heuristic-based evolutionary computing algorithm based on grammatical features from this language in a particular data extraction problem.This work has been partially supported by the Spanish Ministry of Science and Innovation under the projects Castilla-La Mancha project PEII09-0266-6640, COMPUBIODIVE (TIN2007-65989), and by HADA (TIN2007-64718)

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Biblos-e Archivo

Learning Rational Functions

Author: C. Choffrut
C. Choffrut
C. Higuera de la
C. Reutenauer
C.C. Elgot
E.M. Gold
J. Carme
J. Engelfriet
J. Engelfriet
J. Högberg
J. Oncina
J. Oncina
S. Friese
Publication venue: HAL CCSD
Publication date: 01/01/2012
Field of study

International audienceRational functions are transformations from words to words that can be defined by string transducers. Rational functions are also captured by deterministic string transducers with lookahead. We show for the first time that the class of rational functions can be learned in the limit with polynomial time and data, when represented by string transducers with lookahead in the diagonal-minimal normal form that we introduce

HAL - Lille 3

Crossref

INRIA a CCSD electronic archive server